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ASYMPTOTIC  THEORY  FOR  SIEVE  ESTIMATORS  IN 
SEMIMARTINGALE  REGRESSION  MODELS 


by 

Ian.  W.  Me Ke ague 


ABSTRACT 

1  '  \ 

This  paper  studies  the  estimation  of  functions  .  ...  ^  describing  the 

temporal  influence  of  p  covariate  processes  in  a  regression  model  for  semi¬ 
martingales.  McKeague  (1986)  introduced  sieve  estimators  for  .  ..,  ^  and 
established  consistency  in  L  -norm.  In  the  present  paper  the  asymptotic  dis¬ 
tribution  theory  for  the  integrated  sieve  estimators  is  developed.  Smoothed 
sieve  estimators  are  shown  to  be  pointwise  consistent  and  rates  of  convergence 
are  provided. 

S 


1. 


Introduction 


This  work  is  a  sequel  to  McKeague  (1986)  in  which  statistical  estimation  for 
a  nonparametric  regression  model  for  semimartingales  was  introduced.  The  model 
is  given  by 

X(t)  -X(0)  +/jA(s)ds  +  M(t),  t  e  [0,1]  (1.1) 

where  M  is  a  square  integrable  martingale  and 

A(s)  =Z*?=  1a.(s)Yj(s).  (1.2) 

Here  a^,  ....  are  deterministic  functions  of  time  and  Y^,  . . . ,  Y^  are  predictable 

covariate  processes.  Grenander's  method  of  sieves  was  used  to  obtain  estimators, 

~  (n) 

denoted  ,  of  ,  j  =  1,  ...,  p  based  on  n  replicates  of  X  and  its  covariates. 

2 

These  estimators  were  shown  to  be  consistent  in  L  -norm  as  n  ■+«>. 

Our  aim  in  this  paper  is  to  obtain  some  results  on  the  asymptotic  distirbutions 
and  rates  of  convergence  of  the  sieve  estimators.  As  in  density  estimation  a 
satisfactory  distribution  theory  is  possible  only  for  the  integrated  estimator 
given  by 

Ajn)(t)  =/Jdjn)(s)ds,  (1.3) 

where  1  < j  <p.  Also,  to  obtain  rate  of  convergence  results  we  need  to  look  at  a 
smoothed  sieve  estimator 


where  K  is  a  function  with  integral  1,  called  the  kernel  function,  and  b  >0 

n 

is  a  bandwidth  parameter.  Our  results  make  it  possible  to  test  hypotheses 


concerning 


a  and  to  construct  confidence  bands  for 
P 


Aj  ^  =  (s)ds'  j  =  1 >  • • • >  P' 


(1.5) 


The  model  (1.1)  contains  a  number  of  important  special  cases.  Grenander 
(1981),  Ibragimov  and  Khasminskii  (1981)  and  Geman  and  Hwang  (1982)  have  studied 
the  case  where  p=  1,  =  1  and  M  is  a  Wiener  process.  Nguyen  and  Pham  (1982) 

have  treated  the  linear  diffusion  process  with  p=  1.  Aalen  (1980)  has  studied 
the  point  process  case  for  general  p>l.  Aalen  provided  estimators  of  A^,  .  .  .  ,  A^ 
but  it  has  not  been  possible  to  obtain  consistency  or  asymptotic  distribution 
results  for  these  estimators  except  when  p  =  l.  The  importance  of  the  point  process 
version  of  (1.1)  is  that  it  provides  an  alternative  to  the  regression  model  of 
Cox  (1972)  for  the  analysis  of  censored  survival  data.  A  practical  example  might 
arise  in  which  A ( t )  is  the  hazard  rate  for  the  incidence  of  cancer  in  a  subject 


who  has  been  exposed  to  p  carcinogens,  where  Y^(t)  is  the  cumulative  exposure  to 
the  jth  carcinogen  and  cu(t)  represents  the  relative  hazard  rate  of  the  jth  car¬ 


cinogen  at  age  t. 


Our  asymptotic  distribution  results  are  given  in  section  2.  In  section  3 


we  state  results  on  the  rate  of  convergence  of  the  smoothed  sieve  estimators. 
All  proof  are  contained  in  section  4. 


2.  Asymptotic  distributions. 


Let  (X^,  NL ,  Y  _  ,  j  =  1 ,  ....  p),  i  =  l,  .  ..,  n,  denote  n  independent  copies 
of  the  generic  processes  X,  M  and  Y^  ,  j  =  l,  . ..,  p  which  satisfy  the  model  (1.1) 
For  each  j  =  1,  ...,  p  let  C<J> j r »  r>l)  be  a  complete  orthonormal  sequence  in 

L^[0,1].  The  sieve  estimator  6tjn^  for  a^.  ,  which  was  introduced  by  McKeague 
(1986),  is  defined  as  follows.  Let 


a  !-n^  (t)  =  2  a  (j> .  (t)  , 
J  r=1  )T 


(2.1) 


where  (dn)  is  an  increasing  sequence  of  positive  integers  and  aj^  is  the  jr 
element  in  the  pxdn  matrix  defined  by 


(n),  . (n)  -  1  ,D(n). 

vec (a  ')=AV  ’  vec(B  ). 


(2.2) 


Here  vec  is  an  operator  which  takes  a  matrix  and  places  the  elements  in  lexico¬ 
graphical  order  to  form  a  long  column  vector,  B ^  is  a  pxdn  matrix  given  by 


and  A^  is  a  pd  x  pd  matrix  nartitioned  into  p^  submatrices  A?,n^  of  order 
n  n  *  r  jk 

d  *  d  with 
n  n 


A.[k"rf=  S 


In  equation  (2.2),  A^n)  1  is  a  generalized  inverse  of  Aln)  whose  choice  does 
not  affect  the  asymptotic  behavior  of  a[n^ . 


-  i  - 


Define  measures  Uj .  j=l,  ....  p  by  (t)  =  EY^ (t) dt .  The  projections  of 

otj  onto  span  {4^,  r=l,  ....  dn)  in  L  ([0,  1],  du^)  and  L-iL'O,  1],  dt)  are 
denoted  and  a.^n^  respectively. 

Write  Aj11'*  (t)  =  (s  )ds ,  (t)  =  ;  ''  is,ds.  A  function  f  is  said  to  be 

Lipschitz  of  order  y,  where  0<y<l,  if  there  is  a  constant  C  such  that  for  all 
s  ,  t  in  the  domain  of  f,  |  f(t)  -  f(s)  |  <C  I  t  -  s  I  Y.  If  f  is  Lipschitz  of  order 
1  we  simply  say  it  is  Lipschitz.  The  predictable  quadratic  variation  of  M  is 
denoted  <  M  >  . 

The  following  assumptions  were  used  in  McKeague  (1986)  to  obtain  consistency 

c  - (n)  .  .2 

of  a:  in  L  -norm. 

J 

(Al)  /gaj  (t)dt  <»  for  j  =  1,  .  .  .  ,  p. 

(A2)  sup  EY4(t)  <°°  for  j  =  1 ,  .  .  .  ,  p. 

teLO.l]  J 


(A5)  The  function 

v.(t)  =  F.[/jY“(s)d  <M  >.],  t  c  [0.1. 
is  Lipschitz,  for  i  =  1 ,  ....  p . 

Additional  assumptions  needed  for  our  various  weak  convergence  results  arc 


now  stated. 
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(SI)  dn/J[aj  (t)  -  ajn}(t))2dt  — <■  0,  asn-*-®. 


(S2)  n/gCa^.ft)  -ajn^(t)]2dt  - ►  0,  as  n-*-®. 


sup  |  EY  (t)Y  (t)  | 

_ t  €  [  0 , 1 J  J  K _  <  J_ 

[  inf  EY2(t)]\  inf  EY2(t)P  P_1 
teCO.l]  J  te[0, 1]  k 


for  all  1  <  j  <  k  <  p ,  applicable  for  p  >  2 . 

(B2)  <M>  is  absolutely  continuous  and 

sup  E[Y^(t) (—4^—) 2]  <  ®,  for  j  =  l,  p. 

t  e [ 0  j 1 ]  J  dt 

Our  first  result  deals  with  weak  convergence  of  finite  dimensional  distributions 


% 

Theorem  2.1.  Suppose  that  (A1)-(A5),  (SI)  hold  and  d^ -*■  ® ,  d  =  o(n  )  .  Then 
2 

for  h .  e  L  [0,1],  j  =  l,  .  .  .  ,  p 


where 


(^/Jhj(t)[aj(n)  (t)  -  ajn)(t)]dt)P=1  —  N(0,E) 


)  is  the  p*p  matrix  defined  by 


ajj.=E/jHj(t)Hk(t)Yj(t)Yk(t)d<M>t  (2.5) 

H=  (Hr  H  r  .  h  =  (hr  ....  h  r 


H(t)  =  L(t)h(t) ,  L(t)  =  [ K(t) ] 


(2.4) 


K(t)  is  the  p*p  matrix  with  components 


Kjk(t)  =E[Yj(t)Yk(t)]. 


(2.5) 


Corollary  2.2.  Under  the  conditions  of  Theorem  2.1 

A(A<n>-A(n))  ^  m.. 

J  J  J 

where  V ^  denotes  convergence  of  finite  dimensional  distributions  and  nr  is  a 
continuous  Gaussian  martingale  with  nr  (0)  =0, 

Cov (m .  (s )  ,  m.  (t))  =  F./^AtL2.  .  (u)Y2.  (u)d<M>  .  (2.6) 

j  J  J  0  JJ  J  u 


Corollary  2.3.  Under  the  conditions  of  Theorem  2.1  but  with  (A4)  replaced  by 
the  stronger  condition  (Bl) 


J 


m .  . 
J 


Corollary  2.4.  Under  the  conditions  of  Theorem  2.1  but  with  (SI)  replaced  by 
the  stronger  condtion  (S2) 


/R(A(n> 

J 


f 

A . )  — -+  m .  . 
J  J 


If  X  is  assumed  to  be  a  continuous  process  (equivalently  the  martingale  M 
continuous)  then  it  is  possible  to  strengthen  these  results  to  weak  convergence 
in  the  function  space  C[0,1]. 


is 


Theorem  2.5.  Suppose  that  (A1)-(A4),  (SI)  and  (B2)  hold,  d  d  =  o(nz) 

-  n  n 

and  M  is  a  continuous  martingale.  Then 

/nCA^  -A^)  — ►  m.  in  Cr0,l]. 

J  J  J 

Corollary  2.6.  Under  the  conditions  of  Theorem  2.5  but  with  (A4)  replaced  by 

the  stronger  condition  (B2) 

/n(Ajn)  -  A^n})  -2-  m.  in  C[0,lj. 

Corollary  2.7.  Under  the  conditions  of  Theorem  2.5  but  with  (SI)  replaced  by 

the  stronger  condition  (S2) 

/n  (Aj"^  -  A.)  — ! V-+  m.  in  C[0,1). 

Under  appropriate  smoothness  conditions  on  o-  condition  (SI)  or  (S2)  can  b 
satisfied  by  a  careful  choice  of  the  sieve  and  (d^) .  We  mention  two  important 
sieves  for  which  this  is  possible. 

(1)  The  Fourier  sieve.  Take  ;jr  =  ^r,  j  =  1 ,  ...,  p  where  0  ^  ( t )  5  1  and  for  r  >  2 

p  (t)  =  f  ^  cos(irrt),  r  even  f ^ 

r  '•/2sin(n(r+l)t),  r  odd. 


Then  (SI)  is  satisfied  under  the  following  conditons: 
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(F2) 


The  extension  of  a.  to  a  function  of  period  1  on  (-=°,*>)  is  Lipschitz 


of  order  y  >  -j. 

The  stronger  assumption  (S2)  is  satisfied  provided  (FI)  and  the  following 
conditions  hold: 

(F3)  The  extension  of  a.  to  a  function  of  period  1  on  (-“,=»)  has  Lipschitz 


first  derivative; 


(F4)  d4/n 
n 


oo  , 


These  facts  are  consequences  of  a  result  in  approximation  theory  known  as 
Jackson's  inequality,  see  Lemma  4.4. 

(2)  The  Walsh  sieve.  The  following  definition  of  the  Walsh  functions  is  due 
to  Paley  (1932),  other  definitions  can  be  found  in  Beauchamp  (1975).  First 
define  the  Rademacher  function:  (ipr>  r>0)  on  ,0,1)  by 

<|>r(t)  =  (  -  l)k  if  k2_(-r+1)  <  t  <  (k  +  l)2'(-r+1^ 

r.  r  r 

1  /  \j 

Then  put  (t)  =1  and  for  r=2  +  2“+.. .+2  ,  with  r^  >  r.,  >  .  .  .  >  r  z  0  let 

4r(t)  =  ~r  (t)^r  (t)  ...  ^  (t)  for  t  e  [0,1)  . 

1  ^  U 


?  is  called  the  rth  Walsh  function  in  Palev  ordering.  Results  of  Fine  (1955, 
r  •  * 

p.  394)  show  that  the  Walsh  sieve,  defined  bv  taking  as  the  rth  Walsh  function, 
satisfies  assumption  (SI)  provided  a.  is  Lipschitz  of  order  y  >  Unfortunately, 
in  general  (S2)  is  not  satisfied  for  the  Walsh  sieve  unless  the  rate  of  increase 
of  d^  is  prohibitively  large.  However  some  examples  discussed  in  Beauchamp 
(1975,  p.  34)  show  that  the  Walsh  sieve  would  be  preferable  to  the  Fourier  sieve 


if  the  a.'s  have  a  rectangular  form.  Also  the  Walsh  sieve  has  an  advantage  over 


the  Fourier  sieve  in  terms  of  computational  simplicity. 

In  order  to  use  the  results  of  this  section  to  obtain  confidence  intervals 


and  confidence  bands  for  A^ (t)  it  is  first  necessary  to  estimate  the  function 

o  -  f  n 

Gj (t)  =  Em“.(t).  The  matrix  K(t)  given  in  (2.5)  can  be  estimated  by  K  ’ (t)  = 


(K-j^Ct)),  where  K  (t)  =  n'1  Z  Y.  .  (t: )  Y±k  (t)  . 
J  J  i=l  J 


Assuming  that  the  covariate  processes  have  paths  in  D[0,1]  and 


E  sup  |Y.(t)Yk(t)|  <« 


teCO, 1]  3 


it  follows  from  Ranga  Rao  (1963,  Theorem  1)  that 


sup  |K:"}(t)-K  (t)|  -a- —  >  0. 

te[0,lj  JK 


(2-8) 


If  M  is  a  standard  Wiener  process  G^(t)  can  be  estimated  by 


Gjn)(t)  =  /*L^}(s))  kja)(s)ds, 


^  T  fell 


f  (n) 


■f(n) 


where  L'',‘-'(s)  =  (^“'(s))  is  a  generalized  inverse  of  K^(s).  Then  from  (2.8) 


sup  |  G^n)  (t)  -  G  (t)  |  0. 


te [ 0 , 1 J  3 


(2.9) 


If  X  is  a  point  process  the  predictable  quadratic  variation  process  <M>  is 


/^A(s)ds  which  involves  a,,  ...,  a  . 

0  1  p 


In  turn  G.(t)  involves  the  unknown 


'  V- V  n  v«  W.—V*  V*  •T.'V'r.TWi™ 


-  10  - 


s 


a,,  ....  a  .  However,  as  we  have  seen,  a^  is  a  consistent  estimator  of  a.  in 
1  p  J  J 


a 


L  -norm,  so  it  is  possible  to  consistently  estimate  (t)  in  this  case  as  well. 
It  follows  from  Corollary  2.4  that  upper  and  lower  limits  of  a  100(1  -a)  percent 


confidence  interval  for  A.  (t)  are  given  by  A^n'  (t)  +z  ,_n  2G^n-*(t)^,  where  z  is 

j  &  j  a/2  j  a/2 


the  upper  a/2  quantile  of  the  standard  normal  distribution. 

If  M  is  a  standard  Wiener  process  we  can  give  a  confidence  band  for  A^ . 


First  note  that  m .  (t)  G  .  ( 1 )  2  ( G  .  ( 1)  +G.(t)  *  is  distributed  as  W°  (»  --j- — r-  . 

J  J  J  J  G.  (1 j  +  (t J 


G,(t) 


.o  . 


where  W  is  the  Brownian  bridge  on  [0,1].  It  follows  from  (2.9]  and  Corollary 


2.7  that  upper  and  lower  limits  of  a  100(1  -a)  percent  confidence  band  for  A. 
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are  given  by 


,  .  ,  G^n)(t) 

A •  (t)  j  c  n'2G  n,(l)2(l  - ),  te[0,l] 

J  *  J  G^n)(l) 

J 


where  c  is  the  upper  a  quantile  of  the  distribution  of  sup  ,  -,|W  (t)|.  A 
a  rr  te^O ,  1 J 

table  for  this  distribution  can  be  found  in  Hall  and  Wellner  (1980). 


The  assumptions  (A4)  and  (Bl)  can  be  weakened  by  assuming  that  the  sequence 
of  eigenvalues  of  the  matrices  R^,  n>l,  defined  in  Lemma  4.5,  is  bounded  away 


from  0.  However  in  practice  it  probably  would  not  be  necessary  to  go  to  the 
trouble  of  checking  this  given  a  lack  of  any  obvious  collinearity  in  the  covariates. 


Smoothing  the  sieve  estimators 


Ramlau-Hansen  (1983)  has  used  the  methods  of  kernel  density  estimation  to 
smooth  the  Nelson-Aalen  estimator  and  obtain  a  pointwise  consistent  estimator  of 
the  hazard  function.  We  now  apply  kernel  function  smoothing  methods  to  the  sieve 
estimators.  Consider  the  smoothed  sieve  estimator  defined  by  (1.4)  where, 

for  simplicity,  we  assume  that  the  kernel  function  K has  support  [-1,1].  The 
following  result  shows  how  the  sieve  dimension  d^  and  the  bandwidth  parameter 

b^  can  be  specified  so  that  is  a  pointwise  consistent  estimator  of  cu  . 

Theorem  3.1.  Suppose  that  the  Fourier  sieve  is  used,  conditions  (A2) ,  (A3),  (A5) , 
(Bl),  (FI)  and  (F3)  hold  and  K  is  differentiable  with  Lipschitz  derivative.  Let 

dn  =  [n<Sj,  where  -p-  <  5  <  , 

-g  14 

b  =  n  ,  where  —  <  8  <rL 
n  3  5 

Then  for  each  te  (0,1) 

(t)  -  a.  (t)  =  0  (n‘,i(1"6)).  (3.1) 

j  j  p 

If  it  is  assumed  that  X  is  a  continuous  process  then  we  can  establish  uni¬ 
form  consistency  for  ajn\  The  rate  of  convergence  in  uniform  metric  is  naturally 
slower  than  the  pointwise  rate  given  by  (5.1). 


Theorem  5.2.  Suppose  that  (A1)-(A4),  (S2)  and  (B2)  hold,  a.  and  K  are  Lipschitc, 

i  -  _  g  j  j 

d  -*•*,  dri=o(n2),  bn  =  n  where  —  <  8  <  -j ,  and  M  is  a  continuous  martingale. 

Then 

sup  |  a[n)  (t)  -  a  (t)  |  =  0  (n  2(1  “6')). 

teEO.lj  J  J  P 

4.  Proofs  of  Theorems. 


The  following  notation  was  used  in  McKeague  (1986)  where  a  more  detailed 
discussion  can  be  found. 

Notation 


r>l)  denotes  a  complete  orthonormal  sequence  in  L^([0,lj,  dp^)  such 


that  spanEV.  ,  r=l,  ...,  d  }  =  span{<)>.  ,  r=l . d  } 

j  r  n  j  r  n 


for  all  n  > 1 . 


E, .  ,  C(n)  denote  the  coordinates  of  afn^  and  ci?n^  with  respect  to  the  basis 
Jr  V  J  1 


(¥jr>  r>l)  in  L2([0,1],  dp^ ) ,  respectively. 


,  (n) 


a^  denotes  the  pd^xpd^  matrix  partitioned  into  the  p“  submatrices 


a(n)  =  fa^n^  •  r  t  =  1  d  1 

ajk  ajkrf  r’  L  '  •  ’  ’  V  • 


P  n 


cV>=n-1E  Z  /‘T.r(t)Vi.(t)Vik(tH«k(t)  -al(n)(t}}dt 


jr 


k=l  i  =  l 


1  1  *  -I 

+  n  Z  />.  (t)Y.  .  ( t ) dM .  (t) 
•  ,  J0  jr  i j  i 


‘ "  ifi  Vjr(t)Vij(t)dMi(t) 

S(Srf=/J,jr<t)Tk£(t)ECYj(t"'k<t)idt 


.(n) 


denotes  the  pdn  *  pd^  matrix  partitioned  into  the  p“  submatrices 


r-(n)  -  fr(n)  ■  r  £  =  1  d  1 

?jk  Ujkr£‘  ’  1  1.  dn). 


fn)  ;(n)  (n)  (n)  ,  _  . 

£  ,  £  ,  c  ,  P  are  p  *  d^  matrices  defined  by  their  entries  given 


above . 


It  is  easily  checked  that 


,;(n)  .(n).  (n)-I  (n) 

vec  (£  -  £  )  =  av  vec  c 


(4.1) 


where  a*-11-*  1  is  a  generalized  inverse  of  a^  .  The  next  two  lemmas  collect 


various  facts  proved  in  McKeague  (1986) 


Lemma  4.1.  Suppose  that  (A1)-(A3),  (A5)  hold.  Then 

(i)  E  II  a^n-*  -  II  =  0  (-~)  >  where  I!  •  ||  denotes  operator  norm; 

(ii)  E  II  vec  il  “  =  0(-^-); 

(iii)  E  II  vec  (c  ^  -  P  ^  )  II  “=0(  Z  /l)[a.(t)-a!n^  (t)  ]“dt)  +  o(— )  . 

j  ^  i  u  J  -1  n 

(n) 


Lemma  4.2.  (i)(A2)-(A4)  imply  that  £ 


is  invertible  for  all  n>l  and 


(ii)  (A1)-(A5)  imply  that  P(a^  J  is  invertible) 
->1  and  {  ||  a*'11-'  *  ||  ,  n>l}  is  a  tight  sequence  of  random  variables. 


Proof  of  Theorem  2.1.  Writing  and  in  terms  of  the  basis  fy.  ,  r>] 

-  j  J  jr 


we  obtain 
P 


dn 

l  fl  h.  (t)[afn)  (t)  -  <*{n5  (t)]dt  =  /n  E  Eh.  (g?n)  -  5^ 

j«l  0  J  J  J  j  =  1  r=1  jr^jr  j  r 


(4.2) 


where  h.  =  [\  h.(t)V.  (t)dt.  By  Lemma  4.2  we  may  define  a  p*d  matrix 
jr  J  0  j  jr  ^  n 


X<n>  =  (xj^)  satisfying 


,,  (n) .  ,  (n)-  1  ,,  (n) . 

vec  (XK  )  =  Z  vec  (h  ), 


(4-3) 


where  h^11^  (h ) .  Let  denote  the  pdn  *  pdn  identity  matrix.  Then  using 


(4.1)  we  can  split  (4.2)  into  four  parts 


Sn 

vec 

o(nV 

(n) 

vec  p 

(4.4) 

+  / n 

vec 

(a  (n)r 

vec  (c  ^  -  p  ^  ) 

(4-5) 

+  i/n 

vec 

(A(n)r 

(a(n)a(’,)-1  -  I(">) 

(n) 

vec  c 

(4.6) 

+  /n 

vec 

(,\fn)). 

(n) 

vec  c  . 

(4.7) 

By  Lemma  4.2(ii)  (4.6)  converges  to  zero  in  probability.  (4.7)  is  bounded  in 
absolute  value  by 


15 


vec  X 


(n)|,  [(n  |u(n).a(n)|n  ||a(n)-l 


Id: 


and  this  tends  to  zero  in  probability  since 


Cri'i  P  ,1  t-  fnl-1 

sup  ||  vec  X  J  ||  <  [  I  /  — -  dt]'2  sup  ||  t, 1  j  ||  <  00 

n>l  j=l  EY  (t)  n>l 


by  Lemma  4 . 2 (i)  ,  { 1 1  a *||  ,  n>l}  is  tight  and  d^  ||  vec 


Lemma  4.1  (ii),  (iii),  (SI)  and  the  assumption  dn  =  o(n2). 
converges  to  zero  in  probability.  It  remains  to  consider 
in  the  form 

1  n 

■=■  Z  Z  . 

/n  i=l  nl 


where 


•  =  I  fl  uJn)(t)Y. .(t)dM. (t) 
j  =  l  J 


ij  J 


Introduce 


u^  (t)  =  1  (t) 

J  r=l  JT  Jr 


V  , 

Z  .  =  Z  ft.  H.  (t)Y.  .  ( t ) dM .  (t) 

®i  .  ,  1  0  111  l 

j  =  l 


where  is  defined  by  (2.4).  In  an  Appendix  (Lemma  4.6' 

(n)  "> 

operator  theory  to  show  that  v  ->•  LL  in  L  [0,lj  as  n  -* 
2 

Z  .  -*  Z  ■  in  L  (ft,  F,  P)  as  n  +  “>: 

n  i  co  1  v  7  7  ' 


vec  cfn)  II  ] 

(4.8) 

c(n)  I1,  — ^  0  by 

Similarly  (4.5) 
(4.4)  which  we  write 

(4.9) 

(4.10) 

(4.11) 

(4-12) 

we  have  used  some 


Consequent lv 


16 


E(Z  ■  - Z 
ni  < 


.)2  <p 

»1  r 


Z  E{/J(ujn)(t)  -H.(t))Y.  (t)dM(t)}2 


=p.?x  /H 

j=i  j 


-*>  0,  by  (A5)  .  Now  apply  the  Lindeberg-Fel ler  Theorem  to  (4.9).  Note  that  for 
e  >  0 , 


1_ 

n 


Z  E (Z2 -  I  (  I  Z  .1 
.  ,  ni  ni 

i  =  l 


>e  /n))  =E(Z^  I(Z^  >e  n)) 


2 

-*•  0,  since  (Z^,  n>l}  is  uniformly  integrable,  so  the  Lindeberg  condition  is 

*“> 

satisfied.  Thus  (4.9)  converges  in  distribution  to  N(0,o“),  where 
2  2  P  P 

cr  =  EZ  =  Z  Z  a...  An  application  of  the  Cramer-Wold  device  completes  the 
j-1  k=l  JK 

proof  of  the  theorem.  0 

Of  the  corollaries  to  Theorem  2.1  only  Corollary  2.3  needs  some  explanation. 
The  following  lemma  can  be  used  to  rework  the  proof  of  Theorem  2.1  in  terms  of 
the  basis  (<t>jr>  r>l)  instead  of  (i'jr»  r>l)  to  yield  Corollary  2.3.  Note  that 
we  have  replaced  (A4)  by  the  stronger  condition  (Bl)  in  order  to  do  this. 

Lemma  4.3.  Suppose  that  (A2),  (A3),  (Bl)  hold  and  let  R^n-  denote  the  pdn*pdn 

2  fn') 

matrix  partitioned  into  the  submatrices  R:,  ,  j,  k=l,  p  with  entries 

j  k  ' 

Rj(kir 


(4.13) 


Then  J  is  invertible  for  all  n>l  and  sup  ||  Rv,‘-'  x  ||  <®. 

n>l 

Proof.  Let  c.  =  inf  EY2  (t)  ,  Y .  (t)  =  c7  *Y  .  ft)  for  j  =  1 , .  .  .  ,p  and  let  R(n) 
J  teCO ,  1 j  J  J  J  J 

be  a  pd^xpd^matrix  partitioned  the  same  way  as  R^  but  with  entries 


vc)  v(t>  e[  yi)  vi> 3dt  ■ 

It  suffices  to  show  that  is  invertible  for  all  n>l  and  sup  |i  R^n^  *|j  < 


For  then  the  matrix  with  entries  Dj£^>  =  1  (R^  is  an  inverse  for 

R  ^  and 

sup  II  R(n)  *  II  <(  max  c.2)  sup  II  R^n-*  *  II  <°°. 
n>l  j=l,...,p  J  n>l 

Condition  (Bl)  implies  that 


sup  |  ELY  (t)Y  (t)j  |  <  for  j*k. 

te [0 , 1 ]  J  K  P_1 


(4.14) 


Also  note  that 


inf  EYT(t)  =  1  for  j=l,...,p. 
te [ 0 , 1 J  J 

_  -  1  -  2 

Let  b.=  sup  EYt(t),  f.(t)  =  l-b.  EY  (t)  and  I  denote  the  d  *d  identity  matrix 
J  te [ 0 , 1  ]  J  J  J  J  n  n  n 


Ill  -  bT  1R(-n^ 
n  J 


<  sup  Z  (/«h(t)<J>.  (t)f  (t)dt)2 

2  T*=  1  *  J 

he  L  [ 0 , 1 ] 
llh  II  <  1 


<  sup  /*h2(t)f2(t)dt 
II  h  II  s  1  0  J 


(by  Bessel's  inequality) 


<  sup  f2 (t)  <  1. 
te[0,l]  J 


It  follows  that  R^ 
J  J 


is  invertible  and 


JJ 


||  (bj  1rj j  ^ )  1  H 


<  _  ( - ) 

b.  4-  sup  f  -  (t ) J 

J  te[0, 1 ]  J 


inf  EY2(t) 
teCO , 1 1  J 


Let  F^  denote  the  pdnx pd^  matrix  obtained  by  replacing  all 
matrices  of  R  by  zero  matrices.  Let  G  =  R  -  F  and 
pd  *  pd  identity  matrix.  Then  F^ 


off  diagonal  sub- 
1^  denote  the 


is  invertible  and 


Using  (4.14)  and  an  argument  from  McKeague  (1986,  4.10) 


sup  ||  GCn)  ||  <  1, 
n>l 


so  that 


sup  ||  I(n^  -  R(n)F(n)_1  ||  <  s up (  ||  G(n)  II  ||  F(n)_1  ||  )  <  1. 
n>l  n>l 


Thus  R(n)F(n}  1  is  invertible  for  all  n>l, 

sup  ||  (R^F*-^  J)  1  j|  <®, 
n>l 


and  using  (4.15)  we  conclude  that  R 


6 


is  invertible  for  all  n>l  and 


sup  II  R^  1  ||  <  00.  □ 

n>l 


Proof  of  Theorem  2.5.  Condition  (B2)  implies  that  (A5)  holds,  so  that  the 
conditions  of  Corollary  2.2  are  satisfied  and  the  finite  dimensional  distributions 


of 


converge . 


From  the  proof  of  Theorem  3.1  it  can  be  seen  that 


^(Ajn)(t)  -  A j ° }  ( t ) )  =  Un(t)  +Vn(t) 


(4.16) 


where 


sup  I  U  (t)  I  0,  (4.17) 

te[0 , 1]  n 


-  20  - 


1 


n 


V  (t)  =  -  I  Z  .  (t) 
n  /n  i=l  ni 


Z„i<«  =  J  Jo  ^n)(s,t)Ylk(s)dM.(s) 
k=l 


d 

n 


(S,t)  = 

"^ht)V(s) 

] 

r=l 

vec  [A(n)(t)]  =  C(n)  1  vec  [h^n^(t)J 

/o  1[0,t](s)’,jr(s)ds 

for  k=j ,  r=l , 

I  0 

for  k*j ,  r=l , 

d 

n 


d  . 
n 


It  remains  to  show  that  the  sequence  of  processes  {V  ,  n>l)  is  tight  in  C[0,I]. 
In  what  follows  C  denotes  a  postive  constant  independent  of  n  whose  value  may 
change  at  each  occurrence.  From  Billingsley  (1968,  Theorem  12.3)  the  required 
tightness  is  implied  by  the  following  condition:  there  exist  constants  q>0, 
Y>1  such  that  for  all  n>l,  t^,  t^eCO.lj 

E  I  Vn(t2)  -  \n(t1)  |  q  <  C  |  t,  -  tj  !  Y.  (4.18) 

The  processes  Z^,...,  Znn  are  independent  and  have  zero  mean  so  that  by  the 


Marcinkiewicz- Zygmund  inequality  (see  Chow  and  Teicher,  1978,  p.  356)  for  q^l 


E  1  W-W  1  -C  E{n  } 

i  =  l 


*CE|Znl(V-Zm(V 


<C  l  E  |  /J{u^n)  (S  ,t2)  -  u1[n)(s(t1)}Yk(s)dMs  I  q. 
k=  1 


(4.i9: 


For  fixed  t^,  t  ,  k,  n  define  the  process 


-u'n)(s,t1)}¥k(s)<iMs,  t 


£  lO ,  1  ] 


which  is  a  square  integrable  martingale.  By  the  Burkholder-Davi s-Gundy  inequalit; 
(see  Dellacherie  and  Meyer,  1982,  p.  287),  for  q  >  1 


EC  sup  |  N  | )q  < C  ECN]?72. 
teCO.l]  r 


(4.20 


Also,  by  Dellacherie  and  Meyer  (1982,  Theorem  VI II. 30)  and  the  assumption  that 
M  is  a  continuous  martingale, 


L.VJt  *  /Xn)(s,t2)  '  ukn)(s>tl)}2Yk(s)d<M>s’ 


(4.21 


so  that  combining  (4.19),  (4.20)  and  (4.211  with  q  =  4 


E  1  V  (t7)  -  V  (t  )  I  4  <C  Z  E{/J[u^n)(s,t2)  -  u‘n)(s,tl)rvZ(sld<M>s}- 


<C  E  E{/J[u^‘J  (s,t2)  -  uk  (s.t^rds} 

k=  1 


(by  Lemma  4.5  of  McKeague  (1986)  and  (B2)) 


P  n  (n-i  rn)  ?  9 

k=l  r= 1 


SCVl  \Utr^'\{vn)^2}2 


=  C  II  vec  [A(n)  (t2)  -  A(n)  (tj)  J  ||  4 


<C  l|  vec  [h('n-*(t2)  -  (tj)]  ||  4 


(by  Lemma  4.2(D) 


c  ( /; 


EY^(s) 


1  2 

dsj  (by  Bessel's  inequality 


<C|t7-t1l“.  (by  (A3) ) 

Thus  (4.18)  is  satisfied  for  q  =  4  and  y  =  2.  □ 

Tlie  proof  of  Tlieorem  3.1  depends  on  a  result  from  approximation  theory 
known  as  Jackson's  inequality.  The  following  L“-version  of  this  result  (see 
Cernyh,  1969)  is  suitable  for  our  purposes. 


Lemma  4.4.  Let  f:  ]R  -*■  ]R  have  period  1  and  denote  the  partial  sums  of  the 


Fourier  series  of  f  by  fl  (t)  =  Z  f  <f>  (t),  d>l,  where  <t>  is  defined  by  (2.7) 

r=l  r  r 

and  fr  =  /pf (t)<J>r(t)dt.  There  exist  universal  constants  C^,  C ^  such  that 


(i)  for  all  d  >  1 


/J[f(t)  -  fU}(t)]2dt  <C2  sup  c  |  f(t)  -  f(s)  I2; 


•  t-s I <  -f 


(ii)  if  f  is  differentiable  then  for  all  d>l 


/Jtf(t)  -  f{d}(t)]2  <  -j 

dz 


Proof  of  Theorem  3.1.  Introduce 


sup  if'(t)  -  f'(s)  r 

c„ 


J  n  ri  J 


and  note  that 


a-n^  (t)  -  a. (t)  = /  "  „  F(u)[a. (t-bnu)  -  a  (t) ]du. 

J  -  '  t\  - 


Since  a.  is  Lipschitc,  this  implies  that 


sup  |o:  J (t)  -  a  (t) I  =  0(b) 


te [ 0 ,  1  ] 


because  B  >  for  B  >  1/3.  Define 


*  (n)  ^  1  fl  (n),  .  , 

V  (t)  =b"  JoK(~)aj  (s)ds> 

J  r\  n  J 


which  we  shall  use  to  approximate  .  By  (FI),  (F3)  and  Lemma  4.4  (ii),  for 
6  >  1/4, 


/^Cajn}(s)  -  ct.Cs)]2  ds-o(i) 


so  that,  using  the  Cauchy-Schwarz  inequality. 


|a*(n)(t)  -  i{n)(t)  I  ='  1^-  /jK(^-)[a|n}(s)  -  a  (s)  Ids  I 
J  J  n  n  J  J 

^/o^(^)dsA/JCaJn}(s)  -  a  (s)]2dsr5 

n  J  J 


0(— )  o(i) 

/tT  Vn 
n 


(4.23) 


In  view  of  (4.22)  and  (4.23)  ,  to  complete  the  proof  of  the  theorem  it  suffices 

to  show  that  { Aib^  (cu  *-n-®  (t)  -cu^(t)),  n>l)  is  tight  for  all  te  (0,1).  Fix 
t  c  (0,1).  Using  Lemma  4 . 3  we  can  rework  the  proof  of  Theorem  2.1  replacing  the 
basis  (4»  ,  r  >  1)  by  ($,  ,  r  >  1)  and  h,  by 


for  k 


where 


where 


and  h 


■■■  *m  f  m  *  ^ 


=1,...,  p.  Provided  we  can  show  that 


d 

sup  I  h^n^<°°  (4.24) 

n>l  r=  1  Jr 


h^^  =  /gh^11"*  (s)4,^Cs)ds ,  it  follows  that 


v^T'(ajn)(t)  -a:inj(t))  =  /JhJnj(s)(Sjnj(s)  -  ajn'(s))ds 


(n) 


{n}, 


=  U  +  V  , 
n  n 


1  11 

U  — ,  V  =  —  E  Z  .  , 
n  n  JZ  i=l  nl 


Zni =  Z,  /oukn)<s’Yik(s>dMi(s)’ 
k=l 


,(n) 


(n). 


<  J(s)=  l  xkr*kr(s)' 


r=  1 


,,  (n) ,  D(n)-1  ,,  (n) , 

vec  (X  )  =  R  vec  (h  ), 


(n) 


is 


the  p*d 
1  n 


matrix  with  entries  h 


(n) 

kr  ' 


1 

The  condition  dn  =  o(n2)  of 


Theorem  2.1  is  satisfied  here  since  6  <  ?. 


Next 


-  26  - 


EZ^l£pj|E(W')(s>Yk(s)dMs>2 

k=l 

-P  ?  E/JuJ[n)2(s)^(s)d<M> 

k=l 

-Cl  Jn)2(s)ds  (by  (A5) ) 

k=l  u  K 

=  C  ||  vec  X(n)  ||  2 

<C||  vec  h^  ||  2 ,  (by  Lemma  4.3) 


where  C  is  a  constant  independent  of  n  throughout.  Given  (4.24),  this  implies 


sup  EZ^<°°  and  that  (V^,  n>l}  is  tight.  It  remains  to  check  (4.24).  First 


n>l 


,(n) 


note  that  the  derivative  of  h^  J  satisfies  the  Lipschitz  condition 

n 


n 


(4.25) 


for  all  s  ^ ,  s^eCO.lj.  Since  K  has  compact  support  there  exist  e>0,  n^  >  1  such 
that  the  support  of  hjn^  is  contained  in  the  interval  (e,  1-e)  for  all  n>n^. 

For  n>n^  extend  hjn^  to  a  periodic  function  on  the  whole  real  line.  This 
extension  is  differentiable  and  satisfies  the  Lipschitz  condition  (4.25)  for  all 


i 

I 


I 


V 


I 


Sl*  S2e 


Now  applying  Lemma  4.4  (ii) 


0s/‘th!n,(5)]2ds  -  l"h[",2s  — |  sup  c  lh(n)'Cs  )  -h!n)'(s2)l 

un 


50(-r-r)  =  o(l) 

bT 
n  n 


since  0  <  j  6 .  Also 


so  that 


AiS  /J[hjn)(s)]2ds  = /^  K2(v)dv 


lim  E  h  2  =  /\ ^(vjdv 
n-*»  r=i  j  T  J  - 1 


and  this  proves  (4.24).  □ 


Proof  of  Theorem  3.2.  Since  K  is  Lipschitz  it  is  of  bounded  variation.  Denote 
its  total  variation  by  V(K).  Then 


sup  (t)  -  (t)  |  =  sup  I  C~  /nKfir^)d(A^n) 

teCO ,  1  ]  J  J  te[0,l]  bn  0  bn  J 


A.)(s)  | 


-  i~  V  (  K)  sup  |A^n)(s)  -A  (s)| 
n  s?[0 , 1  ]  -1 


since  {v^n(Aj  -  A  ^ )  ,  n>l)  is  tight  in  C[0,1]  by  Corollary  2.7.  Combining  (4.26) 
with  (4.22)  completes  the  proof  of  the  theorem.  □ 

Appendix  to  the  proof  of  Theorem  2.1. 

Let  V  be  a  Hilbert  space  and  T  a  bounded  linear  operator  on  V.  Suppose 

that  T  is  invertible,  i.e.  T  1  exists  and  is  continuous.  We  shall  need  the 

-  1  -  7 

following  Euler-Knopp  series  representation  for  T  .  Let  0  <  A  <  ||  T||  “  and 

define 

fkl  ^  i 

Sl  ’  -  \  l  (I-  AT*T)  T* ,  (4.27) 

i=0 

(k )  - 1 

where  I  is  the  identity  operator.  Then  S  converges  to  T  in  the  uniform 
topology.  From  Groetsch  (1977,  p.  69)  an  error  bound  is  given  by 

||  S(k)  -  T“  1  II  <  ||  T"  1  II  Bk+l,  (4.28) 


where  0  <  8  <  1  and 


B  =  1  -  A  min  {  II  Til  7  II  T- 1 II  '2). 


(4.29) 


In  the  following  result  we  shall  apply  the  Euler-Knopp  representation  to  obtain 


.-1  . 


an  approximation  to  T  in  terms  of  projections  of  T  onto  finite  dimensional 


subspaces.  Let  (V  ,n>l}  be  an  increasing  sequence  of  finite  dimensional  subspaces 


of  V  such  that  u  V  is  dense  m  V.  The  projection  of  V  onto  V  is  denoted 
..  n  r  J  n 

n>l 


P  .  Define  the  operator  T  :V  ->-V  by  T  =PTP*. 
n  nnn  nnn 


Lemma  4.5.  Suppose  that  is  invertible  for  all  n>l  (T  is  not  assumed  to  be 
invertible  here).  Then  the  following  statements  are  equivalent. 

(i)  T  is  invertible  and  P*Tn*Pn-*-T  *  in  the  strong  operator  topology; 

(ii)  sup  ||  T  1  ||  < «. 
n>l 

Proof.  (i)  =>(ii)  by  the  principle  of  uniform  boundedness.  Conversely,  suppose 

that  M  =  sup  ||  T  1  ||  <®.  Put  I  =  P*P  and  note  that  I  converges  strongly  to 
n  nnn  n&  b 

n>  1 

I.  Bv  the  definition  of  T  ,  ||  I  T  I  x  1 1  =||P*TP  x||  >  M  Mil  x  II  and  letting 

nnn  nnn  n 

we  obtain  ||  Tx||  >M  Ml  x  II  ,  for  all  x  e  V,  so  that  T  is  invertible.  Let 
-  2  -  ? 

0  <  X  <  ||  T  ||  .  Then  we  also  have  0  <  X  <  1 1  Tn  1 1  “  for  all  n  >  1 ,  so  we  may  use 

the  same  A  in  the  Euler-  hiopp  approximations  S^M  of  T  1  and  T^1 

respectively.  It  is  easily  checked  from  (4.29)  that  the  £'s  corresponding  to 

_  "> 

T  and  T^  are  bounded  above  by  y =  1  -  AM  “ .  It  follows  from  (4.28)  applied  to 

T  and  T  that 
n 


II  S(k)  -  T'Ml  <  Myk  +  1 


■  (h) 


-  T 


-1 


<  My 


k+1 


(4.30) 


and 


for  alln>l,  k>l.  In  particular 


p*s(k)p  -p*T_1p  ||  <Myk  +  1, 

n  n  n  n  n  n 


(4.31) 


•  00, 


for  all  n  >  1 ,  k>l.  But,  for  k  fixed,  an  induction  argument  shows  that  P*Sl  JP 

n  n 


■  OO 


converges  strongly  to  S  as  n-*-°°.  The  triangle  inequality,  (4.30)  and  (4.31) 


then  show  that  P*Tn^Pn  converges  strongly  to  T  □ 


Lemma  4.6.  Suppose  that  conditions  (A2)~(A4)  are  satisfied.  Then  the  matrix 


K(t)  is  invertible  a.e.  (dt)  and  u^n^  converges  to  LL  in  L2[0,1]  as  n-<-°°.  Here 


,(n) 


K(t)  ,  uj  and  are  defined  by  (2.5),  (4.11)  and  (2.4)  respectively. 


P  P 

Proof.  Let  V=  ©L  ([0,1],  du  ■ )  and  V  »  ©spanf'i'.  ,  r=l,  ....  d  ).  Define 
j  =  !  J  °  j  =  l  J  n 


T:  V  ->  V  by  T(u)  (t)  =  K(t)u(t),  u  e  V ,  t  e  [0,1].  The  matrix  representation  of 


Tn  =  PnTPn  With  resPect  to  the  basis  { (Tlr,  0, . . . ,0) , . . . , (0, . . . ,0,4^) ,  r=l , . . . , 


is  precisely  It  follows  that  u^=P*T  *P  h,  where  u^  =  (uf0"*  , .  .  .  ,u^ ) 

n  n  n  1  p  ' 


1 


By  Lemma  4 . 2  ( i )  ,  T  is  invertible  for  all  n>l  and  sup  ||  T  l!<®.  Thus,  in 
n  n>l  n 


view  of  Lemma  4.5,  T  is  invertible  and  u^-*-T  ^h  in  V  as  n-*-°°. 


2  2 

Thus,  since  the  norms  in  L  (L0.1J,  dUj)  and  L  ( [ 0 , 1 j ,  dt)  are  equivalent  under 


-U 


(A2)  and  (A3),  K(t)  is  nonsingular  a.e.  (dt)  ,  T  h  =  H  given  by  (2.4)  and 


ufn)  ■*  H  .  in  L2[0;  1].  0 


1 
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